Skip to content

[feat] add monkey patch for gsa on device v0.9.2#618

Merged
ygwpz merged 3 commits intoModelEngine-Group:developfrom
Clarence-1103:dev_kvcomp_hbm_1223
Jan 22, 2026
Merged

[feat] add monkey patch for gsa on device v0.9.2#618
ygwpz merged 3 commits intoModelEngine-Group:developfrom
Clarence-1103:dev_kvcomp_hbm_1223

Conversation

@Clarence-1103
Copy link
Copy Markdown
Contributor

@Clarence-1103 Clarence-1103 commented Jan 4, 2026

Purpose

What this PR does / why we need it?

Add a monkey patch for gsa on device to enable this module to be used for profile analysis.

Modifications

Does this PR introduce any user-facing change?

Add a monkey patch for gsa on device to enable this module to be used for profile analysis.

Test

How was this patch tested?

export MODEL_PATH="/home/models/DeepSeek-V2-Lite-Chat"
export VLLM_HASH_ATTENTION=1
python examples/offline_inference_gsa_on_device.py
image

export MODEL_PATH="/home/models/Qwen3-32B"
export VLLM_HASH_ATTENTION=1
python examples/offline_inference_gsa_on_device.py
image

Comment thread ucm/integration/vllm/patch/patch_funcs/v092/vllm_patch.py
@yuanzhg078
Copy link
Copy Markdown
Contributor

You should clearly specify which test you executed.

@Clarence-1103 Clarence-1103 changed the title [feat] add monkey patch for kvcomp on device [feat] add monkey patch for gsa on device v0.9.2 Jan 22, 2026
@Clarence-1103
Copy link
Copy Markdown
Contributor Author

You should clearly specify which test you executed.

describe how was this patch tested

Infinite666
Infinite666 previously approved these changes Jan 22, 2026
Comment thread ucm/__init__.py
@ygwpz ygwpz merged commit 015a128 into ModelEngine-Group:develop Jan 22, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants